Systems and methods for broadcasting an audio or visual alert that includes a description of features of an ambient object extracted from an image captured by a camera of a doorbell device

ABSTRACT

Systems and methods for broadcasting an audio or visual alert that includes a description of features associated with a person extracted from an image captured by a camera are provided. Such systems and methods can include the camera capturing the image when the person is within a field of view of the camera and a processor receiving the image from the camera, processing the image with an artificial intelligence model to identify and extract details of the features associated with the person, and initiating a broadcast of the audio or visual alert by an alert device associated with the camera, wherein the audio or visual alert can include the description of the features associated with the person.

RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 16/860,244, filed Apr. 28, 2020, the content of which is incorporated herein by reference.

FIELD

The present invention relates generally to doorbell devices. More particularly, the present invention relates to systems and methods for broadcasting an audio or visual alert that includes a description of features of an ambient object extracted from an image captured by a camera of a doorbell device.

BACKGROUND

A known doorbell device can initiate a broadcast of a customized audio alert that is pre-generated and associated with a known person or object in response to identifying the known person or object in an image captured by a camera of such a doorbell device. However, known doorbell devices do not generate customized and detailed audio and/or visual alerts for unknown persons or objects.

In view of the above, there is a need and an opportunity for improved systems and methods.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of a doorbell device in accordance with disclosed embodiments;

FIG. 2 is a block diagram of a system in accordance with disclosed embodiments; and

FIG. 3 is a flow diagram of a method in accordance with disclosed embodiments.

DETAILED DESCRIPTION

While this invention is susceptible of an embodiment in many different forms, specific embodiments thereof will be described herein in detail with the understanding that the present disclosure is to be considered as an exemplification of the principles of the invention. It is not intended to limit the invention to the specific illustrated embodiments.

Embodiments of the claimed invention can include systems and methods for broadcasting an audio or visual alert that includes a description of features of an ambient object extracted from an image captured by a camera of a doorbell device. In some embodiments, the doorbell device can include the camera and a processor, and in some embodiments, the processor can (1) receive the image from the camera, (2) process the image with an artificial intelligence model to identify and extract details of the features of the ambient object, and (3) initiate a broadcast of the audio or visual alert by an alert device associated with the camera.

In some embodiments, the audio or visual alert can include the description of the features of the ambient object. For example, in some embodiments, the processor can use the artificial intelligence model to extract the details of a person depicted in the image, such as a height of the person, an eye color of the person, a description of the person's clothing, a physical build of the person, any textual indicators on the person or the person's clothing, and other features as would be understood by a person of ordinary skill in the art. Additionally or alternatively, in some embodiments, the processor can use the artificial intelligence model to extract the details of an inanimate or animate object, other than people, depicted in the image. For example, when the object is a vehicle, the details extracted from the image can include a license plate of the vehicle, a make of the vehicle, a model of the vehicle, and other features as would be understood by a person of ordinary skill in the art.

In some embodiments, the processor can process the image to determine whether the ambient object is a known object or a generic object, and when the ambient object is the known object, the audio or visual alert can include a sound and/or a visual indicator associated with the known object and, accordingly, fail to include the description of the features of the ambient object. Additionally, in some embodiments, when the processor fails to identify the ambient object as the known object, the processor can classify the ambient object as the generic object.

In some embodiments, the alert device can include an audio device within a region outside of which the doorbell device is located, and in these embodiments, the alert device can communicate with a transceiver of the doorbell device via a wireless network or a hardwired connection. Additionally or alternatively, in some embodiments, the alert device can be part of, housed in, and/or integral with the doorbell device and communicate with the processor directly. Additionally or alternatively, in some embodiments, the alert device can include a mobile device of an authorized user of the region outside of which the doorbell device is located, and in these embodiments, the alert device can communicate with the transceiver of the doorbell device via the wireless network.

In some embodiments, the camera can capture the image when the camera is activated and the ambient object is located within a field of view of the camera. In these embodiments, a proximity detector can activate the camera when the ambient object is detected within a detection range of the proximity detector. For example, in some embodiments the proximity detector can be part of, included in, and/or integral with the doorbell device and communicate with the camera and/or the processor directly. Additionally or alternatively, in some embodiments, the proximity detector can communicate with the transceiver of the doorbell device via the wireless network or the hardwired connection.

In some embodiments, a cloud server that is remote from the doorbell device and the region outside of which the doorbell device is located can execute one or more of steps that would otherwise be executed by the processor as described above. For example, in some embodiments, the cloud server can receive the image from the camera and process the image with the artificial intelligence model to identify and extract the details of the features of the ambient object. Additionally or alternatively, in some embodiments, the cloud server can initiate the broadcast of the audio or visual alert. Additionally or alternatively, in some embodiments, the cloud server can process the image to determine whether the ambient object is the known object or the generic object, and in these embodiments, when the cloud server fails to identify the ambient object as the known object, the cloud server can classify the ambient object as the generic object.

FIG. 1 is a block diagram of a doorbell device 20 in accordance with disclosed embodiments, and FIG. 2 is a block diagram of a system 200 in accordance with disclosed embodiments. As seen in FIG. 1, in some embodiments, the doorbell device 20 can include a processor 22, a camera 24, and in some embodiments, a proximity detector 26, an alert device 28, and/or a transceiver 30. As seen in, FIG. 2, in some embodiments, the system 200 can include the doorbell device 20 and a cloud server 32 such that the doorbell device 20 can be located outside of a region R, such that the transceiver 30 can communicate with the cloud server 32 via a network N, and in some embodiments, such that the transceiver 30 can communicate with an alert device 34, for example, when the alert device 34 is separate from the doorbell device 20 and/or includes a mobile device of an authorized user of the region R.

FIG. 3 is a flow diagram of a method 100 in accordance with disclosed embodiments. As seen in FIG. 3, when the camera 24 is activated, for example, by the proximity detector 26, the method 100 can include the camera 24 capturing an image of an ambient object within a field of view of the camera 24, as in 102. Then, the method 100 can include the processor 22 and/or the cloud server 32 processing the image with an artificial intelligence model to identify and extract details of features of the ambient object, as in 104, and initiating a broadcast of an audio or visual alert by the alert device 28 or 34, as in 106.

Although a few embodiments have been described in detail above, other modifications are possible. For example, the logic flows described above do not require the particular order described or sequential order to achieve desirable results. Other steps may be provided, steps may be eliminated from the described flows, and other components may be added to or removed from the described systems. Other embodiments may be within the scope of the invention.

From the foregoing, it will be observed that numerous variations and modifications may be effected without departing from the spirit and scope of the invention. It is to be understood that no limitation with respect to the specific system or method described herein is intended or should be inferred. It is, of course, intended to cover all such modifications as fall within the spirit and scope of the invention. 

1-20. (canceled)
 21. A device comprising: a camera that, when activated, captures an image of a person within a field of view of the camera; and a processor that (1) receives the image from the camera, (2) processes the image with an artificial intelligence model to identify and extract details of one or more visual features associated with the person, and (3) initiate a broadcast of an audio alert by an alert device associated with the camera, wherein the audio alert includes an audio recitation of the details of the one or more visual features associated with the person.
 22. The device of claim 21 wherein the alert device includes an audio device within a region outside of which the device is located.
 23. The device of claim 21 wherein the alert device includes a mobile device of an authorized user of a region outside of which the device is located.
 24. The device of claim 21 further comprising: a proximity detector that activates the camera when the person is detected within a detection range of the proximity detector.
 25. The device of claim 21 wherein the processor processes the image to determine whether the person is a known object or a generic object, and wherein the processor initiates the audio or visual alert in response to classifying the person as the generic object.
 26. The device of claim 21, wherein the one or more visual features associated with the person comprise one or more of: a height of the person, an eye color of the person, a description of clothing worn by the person, a physical build of the person, or one or more textual indicators on the person or clothing worn by the person.
 27. The device of claim 21 further comprising a transceiver, wherein the processor initiates the broadcast of the audio alert by the transceiver and to, and for output by, the alert device associated with the camera.
 28. A system comprising: a device; a camera of the device that, when activated, captures an image of a person within a field of view of the camera; and a cloud server remote from the device that (1) receives the image from the camera, (2) processes the image with an artificial intelligence model to identify and extract details of one or more visual features associated with the person, and (3) initiates a broadcast of an audio alert by an alert device associated with the camera, wherein the audio alert includes an audio recitation of the details of the one or more visual features associated with the person.
 29. The system of claim 28 wherein the alert device includes an audio device within a region outside of which the device is located.
 30. The system of claim 28 wherein the alert device includes a mobile device of an authorized user of a region outside of which the device is located.
 31. The system of claim 28 further comprising: a proximity detector that activates the camera when the person is detected within a detection range of the proximity detector.
 32. The system of claim 28 wherein the cloud server processes the image to determine whether the person is a known object or a generic object, and wherein the cloud server initiates the audio or visual alert in response to classifying the person as the generic object.
 33. A method comprising: when a camera of a device is activated, capturing an image of a person within a field of view of the camera; processing the image with an artificial intelligence model to identify and extract details of one or more visual features associated with the person; and initiating a broadcast of an audio alert by an alert device associated with the camera, wherein the audio alert includes an audio recitation of the details of the one or more visual features associated with the person.
 34. The method of claim 33 wherein the alert device includes an audio device within a region outside of which the device is located.
 35. The method of claim 33 wherein the alert device includes a mobile device of an authorized user of a region outside of which the device is located.
 36. The method of claim 33 further comprising: activating the camera when the person is detected within a detection range of a proximity detector.
 37. The method of claim 33 further comprising: determining whether the person is a known object or a generic object; and in response to determining that the person is the generic object, initiating the broadcast of the audio or visual alert.
 38. The method of claim 33 further comprising: processing the image with the artificial intelligence model at the device.
 39. The method of claim 33 further comprising: processing the image with the artificial intelligence model at a cloud server that is remote from the device. 